Semi-Supervised Learning on Graphs through Reach and Distance Diffusion

نویسنده

Edith Cohen

چکیده

Semi-supervised learning algorithms are an indispensable tool when labeled examples are scarce and there are many unlabeled examples [Blum and Chawla 2001, Zhu et. al. 2003]. With graph-based methods, entities (examples) correspond to nodes in a graph and edges correspond to related entities. The graph structure is used to infer implicit pairwise affinity values (kernel) which are used to compute the learned labels. Two powerful techniques to define such a kernel are “symmetric” spectral methods and Personalized Page Rank (PPR). With spectral methods, labels can be scalably learned using Jacobi iterations, but an inherent limiting issue is that they are applicable to symmetric (undirected) graphs, whereas often, such as with like, follow, or hyperlinks, relations between entities are inherently asymmetric. PPR naturally works with directed graphs but even with state of the art techniques does not scale when we want to learn billions of labels. Aiming at both high scalability and handling of directed relations, we propose here Reach Diffusion and Distance Diffusion kernels. Our design is inspired by models for influence diffusion in social networks, formalized and spawned from the seminal work of [Kempe, Kleinberg, and Tardos 2003]. We tailor these models to define a natural asymmetric “kernel” and design highly scalable algorithms for parameter setting and label learning.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

18.S096: Graphs, Diffusion Maps, and Semi-supervised Learning

These are lecture notes not in final form and will be continuously edited and/or corrected (as I am sure it contains many typos). Please let me know if you find any typo/mistake. Also, I am posting short descriptions of these notes (together with the open problems) on my Blog, see [Ban15]. Graphs will be one of the main objects of study through these lectures, it is time to introduce them. Grap...

متن کامل

Comparing Diffusion Models for Graph–Based Semi–Supervised Learning

The main idea behind graph-based semi–supervised learning is to use pair–wise similarities between data instances to enhance classification accuracy (see (Zhu, 2005) for a survey of existing approaches). Many graph–based techniques use certain type of regularization that often involve a graph Laplacian operator (e.g., see (Belkin et al., 2006)). Intuitively, this corresponds to a diffusion proc...

متن کامل

Composite Kernel Optimization in Semi-Supervised Metric

Machine-learning solutions to classification, clustering and matching problems critically depend on the adopted metric, which in the past was selected heuristically. In the last decade, it has been demonstrated that an appropriate metric can be learnt from data, resulting in superior performance as compared with traditional metrics. This has recently stimulated a considerable interest in the to...

متن کامل

Adaptive Distance Metric Learning for Diffusion Tensor Image Segmentation

High quality segmentation of diffusion tensor images (DTI) is of key interest in biomedical research and clinical application. In previous studies, most efforts have been made to construct predefined metrics for different DTI segmentation tasks. These methods require adequate prior knowledge and tuning parameters. To overcome these disadvantages, we proposed to automatically learn an adaptive d...

متن کامل

Graph Based Microscopic Images Semi and Unsupervised Classification and Segmentation

In this paper, we propose a general formulation of discrete functional regularization on weighted graphs. This framework can be used to on any multi-dimensional data living on graphs of arbitrary topologies. But, in this work, we focus on the microscopic image segmentation and classification with a semi and unsupervised schemes. Moreover, to provide a fast image segmentation we propose a graph ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2016

Semi-Supervised Learning on Graphs through Reach and Distance Diffusion

نویسنده

چکیده

منابع مشابه

18.S096: Graphs, Diffusion Maps, and Semi-supervised Learning

Comparing Diffusion Models for Graph–Based Semi–Supervised Learning

Composite Kernel Optimization in Semi-Supervised Metric

Adaptive Distance Metric Learning for Diffusion Tensor Image Segmentation

Graph Based Microscopic Images Semi and Unsupervised Classification and Segmentation

عنوان ژورنال:

اشتراک گذاری